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The quantum violation of the Leggett-Garg (LG) inequality is less surprising than that of Bell- 
type inequalities or contextuality inequalities because the latter two involve correlations between 
commuting observables, and hence do not lead to signaling, whereas the former involves 'signaling in 
time'/memory. We derive a general Bell-type inequality, encompassing both the spatial and temporal 
variants, where the classical bound is determined by the maximum violation of the inequality allowed 
by the signaling in the correlations. This is used to make precise the notion of non-classicality, and 
in particular, to explain the sense in which some quantum violations of the LG inequality are 
classical, though sufficiently large quantum violations will be non-classical (thereby demonstrating 
the quantumness of a qubit). We indicate that this weakened concept of non-classicality also entails 
features like intrinsic randomness, no-cloning, monogamy of correlations, etc., though to a lesser 
degree than in non-signaling, nonlocal correlations. We find that quantum invasiveness is, apart 
from nonlocality and contextuality, a fundamental nonclassical resource in QM, related ultimately 
to non-commutativity. Quite generally, a theory may be called non-classical if it entails effects that 
cannot be explained by the signaling available within the theory. Finally, we draw attention to an 
analogy between non-classicality and the meta-mathematical concept of (Godel) incompleteness. 



Introduction. One of the most remarkable features of 
quantum mechanics (QM) is nonlocality, characterized 
by the violation of Bell-type inequalities [TJ [2] . Leggett 
and Garg (LG) [3] proposed the temporal version of a 
Bell-type inequality that is satisfied by all noninvasive- 
realist theories (For related variants, cf. Ref. [4 ). 'Re- 
alism' is the assumption that the given system Q pos- 
sesses determinate properties prior to measurement [5]. 
'Noninvasiveness' is the assumption of measurability of 
a system without disturbing the subsequent evolution of 
its possessed value. Thus, measurement only reveals a 
pre-existing value. 

Suppose the quantities a or a' are measured at time 
tA and b or b' at time > tA, where the observables 
satisfy a, a', b, b' = ±1. Under the stated assumptions, 
a(b + V) +a'(b- V) = ±2, so that 

\(ab) + (ati) + (a'b) - (a'b')\ < 2, (1) 

which is the LG inequality. Violation of the LG in- 
equality implies the negation of the assumption of non- 
invasiveness and/or realism. Note that Eq. is a 2-time 
(tA,tB) variant of the 4-time version that was the origi- 
nal LG inequality, and is essentially the same, except to 
bring the analogy with the Bell inequality closer [6]. 

In the microscopic scale, because of non-commutativity 
of quantum observables, measurement in general 'col- 
lapses' the quantum state and is thus invasive. The 
LG inequality was thus originally proposed for macro- 
scopic systems, which can in principle be measured non- 
invasively. 

The assumptions behind the derivation of the Bell in- 
equality are localism and realism. As a classical theory 



is necessarily local and realist, a violation of Bell's in- 
equality implies non-classicality. Likewise, as a classical 
theory is necessarily non-contextual and realist, a vio- 
lation of a contextuality inequality [7j also implies non- 
classicality. However, the violation of the LG inequality 
does not entail non-classicality, since a classical theory is 
not necessarily non-invasive. 

Invasiveness implies a disturbance or signal [5j carried 
forward in time from one measurement to another on the 
same particle, such that the probability distribution of 
a subsequent measurement depends on the choice made 
earlier [9]. For sufficiently large signal, an invasive-realist 
classical mechanism can presumably be used to violate 
the LG inequality. Its violation is thus less surprising 
than that of Bell-type inequalities, which involve no sig- 
naling. 

From this perspective, the violation of the LG inequal- 
ity is not non-classical unless the degree of violation is 
shown to be larger than can be explained by a classical 
mechanism that uses the signaling in the correlations. We 
note that in the context of the LG inequality, 'signaling' 
means 'signaling in time' or carry-forward of memory. It 
does not contradict the no-signaling principle that holds 
between geographically separated particles, and hence is 
not prohibited by special relativity. 

The problem thus arises of clarifying the sense, and 
scope, in which violation of the LG inequality is nonclas- 
sical. More generally, one wishes to extend the notion of 
non-classicality beyond contexts of commuting observ- 
ables (as required in a setting appropriate to test a Bell- 
type or contextuality inequality) to correlations between 
sequential non-commuting observables, thus determining 
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the full scope of non-classicality. This is the purpose of 
this article. Our main result is the derivation of a gen- 
eralization of the Bell or LG or contextuality inequality 
which is a more faithful witness of nonclassicality when 
signaling/memory is present. 

Disturbance. A set of correlations P = P(x, y\a, b) 
between two parties A and B (where x, y and a, b are 
possible discrete measurement outcomes and settings, re- 
spectively) can be described by a joint distribution (JD) 
if and only if it has a deterministic hidden variable (HV) 
description P(x,y\a,b) = J p(X)P(x\a, X)P(y\b, X)dX 
[TU] , For the case where a, b = 0,1 and x, y = ±1, 
this is equivalent to the satisfaction of the inequality 
A(P) < 2, where A(P) = P(x = y\00) + P(x = 
2/(10) ~ P(x = y\U) + P(x = 2/|01) - P(x ? 2/|00) - 
P(x ^ y\10) + P(x ^ 2/| 11) - P(x ^ 2/101), where 
P(x = y\ab) = P(+l,+l|a,6) + P(-l,-l|a,&) and 
P(x ^ y\ab) = P(+l,-l|o,6) + P(-l, +l|o, 6). When 
a and b are sequential measurements on the same parti- 
cle, then A(P) is the left-hand side of Eq. 0. When a 
and b belong to different particles, the above inequality 
is the usual CHSH inequality [2]. 

Violation of the CHSH/LG inequality, or equivalently, 
a lack of JD, constitutes what we call a disturbance. Dis- 
turbance may or may not be signaling (the property that 
measurement at A reveals the B settings or vice versa). 
In QM, if the random variables denoting measurements, 
A and B, belong to spatially separated particles (as is 
the case in a Bell setting), then the disturbance is non- 
local and non-signaling. If a and b pertain to the same 
particle (as in an LG setting), then the disturbance is 
local and signaling. In local QM, this disturbance arises 
on account of non-commutativity of observables. For in 
this case, the product of projectors to these operators 
II^-i-iIIb^-i-i are not hermitian, and hence do not de- 
fine a valid probability distribution in terms of their ex- 
pectation values. Non-signaling disturbance also has its 
ultimate roots in non-commutativity. 

Disturbance implies that the communication cost for 
simulating P is greater than 0, i.e., it is impossible for two 
(or more) players, given pre-shared randomness, to recre- 
ate the given correlations using local strategies and with- 
out any communication [11 . A lower bound on the aver- 
age communication cost to simulate P may be obtained 
as follows [12] • We may express P as a convex combi- 
nation of deterministic strategies P = XijXii 

where d Aj is the Aj-th deterministic protocol requiring a 
communication cost of cj = 0, 1, 2, ■ • • bits to implement. 
A deterministic protocol is of the form P(x, y\a, b) = 
S f(a,b) SV g(a,b)i wheie fi a M (#0,6), resp.) specifies Al- 
ice's (Bob's, resp.) outcome measurements a and b (with 
the convention that in the binary case or 1 outcomes 
represent ^fl). If / = f(a) and g = gib), then no commu- 
nication is needed, and this strategy defines local corre- 
lations. A convex combination of local strategies clearly 



defines a classical protocol. If / = f(a) and g = g(a, b) 
then Alice needs to communicate 1 bit to Bob indicating 
her choice of a. 

For example, the deterministic strategy Sq -5 " 6 defined 
by P(x,y\a,b) = SqS^^ has signaling from Alice to Bob, 
and maximally violates the CHSH inequality Similarly: 
S^ b defined by P(x,y\a, b) — SfS^. The average com- 
munication cost of P is C(P) = min{ gj .} J2j Qj c j> where 
qj = J2\ 9Aj ■; that is, the average communication cost, 
minimized over all possible decompositions of P into de- 
terministic strategies. 

Let Aj = maxA . 

A(d A 0- One can show that C(P) > 
A j [ P ^_ A A " cj, where J is the integer j(^= 0) that maximizes 
(Aj -A )/c r 

Particularizing to the CHSH or LG inequality, we know 
that the maximum local bound is given by Ao = 2. We 
find that J = 1, with Ai = 4, c\ = 1 bit, so that the 
above lower bound becomes: 

C(P) > ^A(P) - 1. (2) 

For maximal violation of the LG inequality, which is 2y/2, 
we find the required communication cost C(P) = \/2 — 
1 0.41. (It may be remarked that Ref. [IT] shows that 
1 bit of communication suffices to reproduce correlations 
for an arbitrary pair of projective measurement). In the 
CHSH case, where P is non-signaling, the lower bound 
([2]) is also sufficient to reproduce P [12*] . 

Signaling. In strategy S" - *' 6 , the maximum signaling 
from Alice to Bob (when she chooses a = or a = 1 
with equal probability and Bob always chooses b — 1) is 
also 1 bit. The analogous strategy with signaling from 
Bob to Alice is S^ b given by P = 5%. b 6%. Suppose the 
correlations P contain signaling from Alice to Bob (but 
not the reverse), then we can interpret the correlations 
temporally with Bob measuring the state measured by 
Alice. Provided the correlations are interpreted locally 
(as it is in the LG situation) , signaling represents memory 
capacity and then there is no contradiction with the no- 
signaling principle or special relativity. 

Let Sj denote the maximum signaling allowed by any 
j-bit one-way signaling strategy. We may regard Al- 
ice's input random variable A, and Bob's output ran- 
dom variable Y as constituting a Markov chain A —> Y 
[13] . By the data processing inequality, cj = H{A) > 
I max (A : Y) — Sj, where H{A) is the Shannon entropy 
and I maK (A : Y) denotes the maximum mutual informa- 
tion I (A : Y) = H(A) - H{A\Y) [13]. By convexity, a 
probabilistic protocol over deterministic strategies with 
at most Cj bits of communication cost will entail signaling 
S(P) of a most Cj bits, and so 

S(P) < C(P). (3) 

In particular, a uniform mixture of Sq - *' 6 and S" - ^ 6 yields 
the PR box [14] . which is non-signaling but maximally 
violates the CHSH inequality. 
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Specializing to the case of two measurements and two 
outcomes, given correlations P, let Pq (P-f, resp.) rep- 
resent the probability that Bob, measuring observable 
b, finds y = +1 when Alice measures observable a = 
(a = 1, resp.). The necessary condition for signaling is 
s = maxf, |Pq — Pf | > 0, where b. The signalled informa- 



tion S(P) received by Bob is quantified by the mutual 
information I(A : Y) maximized over Alice's and Bob's 
choices. Letting a and (3 = 1 — a denote the proba- 



bilities with which Alice chooses a or a', Q 
P = aP b + f3P b and Q = aQ\ + f3Q\, 



b — 
3 ~~ 

explicitly 



1 - P? 



S(P) 



max 

b. a 



[H(a) + aP b log (aP b /P) + f$P\ log (pP h jP) + aQ b Q log (aQ b /Q) + @Q\ log ( PQ\/Q)} 



(4) 



The above may straightforwardly be generalized when 
Alice and Bob have more than 2 settings. It may be 
verified that as expected, S'(P) = when P^ = Pf , and 
is, maximally, S(P) = 1 when P^ — and P\ = 1 or vice 



versa, where b is the value of b that maximizes 5(P). 

For a qubit in QM, the correlations for sequential mea- 
surements are given by 



P(x,y\&,b) = Tr ( 1 + 2 Vb 1 + 2 X % l -±p\ = 1 + ^Tr(ap) + ^Tr (bp) + °f Tr({a,b}p) + V -Tr(abap). (5) 



r 



Without loss of generality, let b = b. From Eq. 
condition for signaling is 



5k, the 



^\Po-Pi 



1. 



-Tr 



(^aba — a'ba'^j p 



> 0, 



(G) 



Tr 



(^abap 



which is bounded above in QM as: \s\ < \ 

j Tr (^a'ba! p^j < |. Let the density operator obtained 

by measuring a (resp., d') on \ip) be po (resp., pi). The 
above bound on signaling strength s can also be stated in 
terms of trace distance: r = |||po — Pill > 0. Intuitively, 
we expect to maximize s in Eq. ^ or r when is an 
eigenstate of a, and a! maximally fails to commute with a 
(i.e., the two observables form a pair of mutually unbiased 
bases). By a similar argument as used to bound s, trace 
distance satisfies |||po — Pill < |- This inequality, as well 
as ([6]), are saturated for the settings: 



\tp) = |0>;d = a z ,a 



(7) 



It follows from Eq. (J6J> , [a, b] ^ and/or [a',b] ^ is 
necessary for signaling, or equivalently, an interference 
between the eigenstates of the operators a and a' [3]. If 
Alice and Bob are geographically separated, and p is a 
two-qubit state, Eq. ^ still holds, with the represen- 
tations a <- a ® I, a' <- a' ® I, S •<— I ® b, V <- I ® V . 
We have [&, 6] = [d', b] = 0, and we obtain the usual no- 
signaling. It may be noted that, even in the local case, 



there is no backward signaling from Bob to Alice, i.e., 
Alice's outcome probabilties as derived from Eq. ([5| are 
independent of Bob's settings, as expected. 

From Eq. we obtain the correlator (ab) = 

^2 X xyP(x, y) = |({dS}) = a ■ b, where a — a ■ a and 

b = b ■ a. We note that the correlator is independent of 
temporal order, at least for qubits, even though there is 
a signaling from Alice to Bob. 

Thus, the correlations are the same as that obtained by 
von Neumann measurements on singlets, and it follows 
that the LG inequality is microscopically violated, and 
that the Cirelson bound [H] for Bell-type inequalities 
also holds for the LG inequality ([T]) [8]. 

If the state is the maximally mixed J/2, then 
P\\ cx Tr (aba - a'ba') = 0, since a 2 = (a') 2 = 
any qubit observable with spectrum ±1. Similarly r = 
0. Interestingly, because the correlators (ab) are state- 
independent, this state will nevertheless violate the LG 
inequality maximally. 

Main result: Non-classicality. In the classical world, 
clearly Alice and Bob can violate Bell-type inequalities 
provided she can signal to him. Intuitively, P is classical 
if it allows signaling strong enough to accommodate the 
inequality violation it entails. A deterministic strategy 
like S"~ >b is classical in the sense that the maximum sig- 
naled information allowed in it, which is 1 bit, can be 
used to augment a local strategy to re-produce the dis- 
turbance property of S^ b , such as maximally violating 
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the CHSH inequality. 

(1) General criterion: If the maximum signalable in- 
formation in P exceeds the minimum communication 
cost for reproducing the disturbance, then clearly the in- 
equality violation can be simulated classically given the 
same amount of signaling. (We don't require that Nature 
should use the same strategies.) Let C\(P) = |A(P) — 1. 
Note that C(P) = C A (P) for non-signaling P, and in gen- 
eral C(P) > C A (P), as follows from Eq. From this 
perspective, the necessary and sufficient condition for the 
classicality of P, even in the presence of signaling, is: 



S(P) > C A (P) 



(8) 



Eq. ^ generalizes both the Bell and LG inequalities to 
give a general criterion for classicality. 

(2) Bell, LG and contextuality inequalities in QM: In 
the spatial (i.e., usual) Bell inequality scenario, by no- 
signaling S{P) = 0, so that Eq. (|8) yields C A (P) < 0, 
which is just the usual CHSH inequality. By the crite- 
rion pj, any violation of the CHSH inequality is thus 
necessarily non-classical, as expected. 

For sequential measurements of a qubit, S(P) in Eq. 
Q is bounded above by the Holevo quantity [13] \ = 
S(ap + /3pi) — aS(po) — /3S(p\). Direct substitution 

using settings (7) in (4) yields H(a) + alogfj^jj + 

log fiq^M > which, when maximized, yields S(P) = 
log(5) — 2 ps 0.32 bits at a = 3/5. This is the maximum 
signaling quantumly possible from Alice to Bob [8], and 
saturates the Holevo bound. Since the minimum commu- 
nication cost for a maximal quantum violation (of 2-\/2) 
of the LG inequality is about 0.41 bits, violation of the 
inequality ^ follows. Thus maximal quantum violation 
of the LG inequality is indeed non-classical: a communi- 
cation cost of at least 0.41 — 0.32 = 0.09 bits is required 
on average over the signaling provided by the invasive- 
ness, in order to violate the LG inequality in a classical 
simulation. 

On the other hand, non-maximal violations of the LG 
inequality are not necessarily non-classical, as seen from 
Figurefl] where a, b, a' and b' are oriented in the xz plane, 
and separated by angular intervals 9; the initial state is 
the +1 eigenstate of a. 

Setting C A (P) < log(5) - 2, we find that A(P) < 
2(log(5) — 1) ps 2.64, a tighter version of the LG inequal- 
ity, that gives a sufficient condition for nonclassicality. 
That it is not necessary is clear from Figure [T] in the 
range approximately given by 8 <E [0.99, 1.03]. 

Eq. Q suggests that one can define a measure of 
non-classicality, the normalized communication cost : 
r) = min{0, (C A (P) - S(P))/C A (P)}, with the conven- 
tion r\ — if Ca(P) = 0. For correlations that are 
classical, 77 = 0. Note that the maximally nonlocal and 
signaling correlations S" - * 6 and S" <_h are classical be- 
cause signaling can precisely account for the nonlocality. 



FIG. 1: For the range of angles 9 given, the LG inequality is 
violated, with normalized LG inequality violation (A(P)/Ao, 
small-dashed line) exceeding 1. In the region where signaling 
(S(P) with maximization over b restricted to {&,&'}, large- 
dashed line) exceeds the communication cost (Ca, dotted 
line), above 6 > 1.08, the LG inequality violations in QM 
are not non-classical, according to Eq. B. The plain line 
marks the Holevo bound (asymptotically accessible signal). 



For any non-signaling violation of the CHSH inequality, 
f] = 1, indicating maximal nonclassicality; while for the 
maximal quantum violation of the LG inequality, we find 
r\ ps 0.09/0.41 fa 0.22. In this sense, the maximal quan- 
tum violation of the CHSH inequality is more nonclas- 
sical than the maximal quantum violation of the LG in- 
equality. Interestingly, as we saw earlier, for the max- 
imally mixed state S(P) — 0, implying 77 = 1. Thus 
the maximal violation of the LG inequality by the max- 
imally mixed state is more non-classical than that by a 
pure state. In contextuality inequalities [7], C A (P) > 
in order to supply context information, but S(P) = 
because observables in a correlator are pairwise commu- 
tative, so that correlations are maximally non-classical. 

Intuitively, a single qubit is a non-classical object. 
However, Bell has shown that the outcome of a single pro- 
jective measurement on any qubit state can be classically 
simulated with a hidden variable (HV) model [TB"] . Fur- 
ther, a proof of non-classicality via contextuality requires 
dimension greater than 2, which thus does not apply to a 
qubit subjected to projective measurements. Our result 
above shows that correlations based on sequential mea- 
surements on a qubit are nonclassical, thereby making a 
qubit nonclassical. 

Instrinsic randomness, monogamy, etc. The above re- 
sults apply not just to QM but to any theory in which 
correlations P can be defined. A theory is classical if 
Vp(»?) = 0, while it is nonclassical if 3p(??) > 0. It is 
known that fundamental randomness, Heisenberg uncer- 
tainty, privacy and monogamy of correlations, and the 
impossibility of perfect cloning are necessary features in 
non-signaling nonlocal theories |17j . In theories of in- 
termediate nonclassicality (i.e., < i] < 1), we expect 
these features to diminish in strength but not vanish. 
We briefly illustrate this point here. For example, vio- 
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lation of the LG inequality by a qubit can be used to 
perform a computation more efficiently than a classical 
bit [6]. Suppose Alice and Bob share a PR box defined 
by a ■ b = x ® y. With perfect cloning by Bob, one has 
a -b' = x®y', so that: a- (b(B b') — y(By', from which Bob 
determininstically obtains Alice's input. More generally, 
if he can clone it only with probability p (i.e., his clone 
is a mixture of a PR box with probability p and a ran- 
dom outcome state), the signal received by Bob is seen 
to I (A : Y) = 1 — H((l — p)/2.), a monotonic function 
in the range p S [0, 1], indicating that with greater bits 
signaled, there is lesser no-cloning. 

Consider the correlation Q(p) = pS e ^ b + (l~p)S^ b . 
Defining intrinsic indeterminancy / = min{p, 1 — p} 
and signal s by Eq. ([6]), we find s + 21 = 1, imply- 
ing a trade-off between signaling and randomness (More 
generally, s + 21 > 1 [18 ). Since C A (Q) = 1, and 
s = 1 5(Q) = 1, by our definition, s < 1 indicates 
non-classicality, which is equivalent to / > 0. Thus, our 
signaling-based definition of non-classicality is consistent 
with the traditional notion associating non-classicality 
with intrinsic randomness, etc. 

As noted earlier, the disturbance in the local sector 
is an obvious consequence of non-commutativity. The 
signaling also is a consequence of non-commutativity, as 
follows from Eq. Thus the nonclassicality of quan- 
tum invasive measurements, like that of nonlocality and 
contextuality, also follows ultimately from quantum non- 
commutativity. It is different from them in that it in- 
volves correlators between non-commuting observables. 
This shows that quantum invasiveness should be thought 
of as another basic nonclassical feature, that is at par 
with contextualtiy and nonlocality. 

Discussions and Conclusions. We now mention briefly 
the similarity of our main result to a theorem in meta- 
mathematics, the study of mathematics using mathemat- 
ical tools. (More generally, a meta-theory is a theory 
about a theory). At the moment, the analogy is admit- 
tedly very sketchy, but we feel that it offers potential 
clarity in the way we think about physical theories by 
permitting an 'outside view'. A well-known metamathc- 
matical result is that of incompleteness due to Godel [TH] , 
according to which any axiomatization A of arithmetic, 
if it is consistent, then it is incomplete, in the sense of 
there being truths (theorems) expressible in A but not 
provable within A. An existential proof of the result is 
that the set of theorems in A has the cardinality of the 
continuum, i.e., it is uncountably large, whilst the num- 
ber of proofs is only countably infinite. 

To be precise, the concept of communication cost is 
meta-theoretic, as therefore are Eqs. ^ and ([8]), which 
are statements about QM. Given theory T, predictions in 
it are, technically, theorems, while signaling indicates a 
sequence of causes and effects, which is like a train of logi- 
cal inferences, and thus constitute a proof. Completeness 
entails that all predicted effects in the theory are explain- 



able via manifest signaling. Consistency entails that only 
predicted effects are explainable via signaling. Thus non- 
classicality corresponds to incompleteness in that there 
exist effects not attributable to signals. A little thought 
shows that it is related to EPR incompleteness [S]. 

The connection to Godel's result is more apparent in 
its information theoretic version, due to Chaitin |20) . 
according to which, a theorem cannot be derived from 
axioms that contain less information than the theorem. 
In the present context, nonclassicality is analogously the 
case when a theory (a set of theorems) cannot be ex- 
plained from initial conditions (axioms) and given rules 
of inference (the available signaling). This suggests that 
nonclassicality is not pathological but a natural state of 
affairs in 'theoryspace'. 
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